Attribute Weighting via Differential Evolution Algorithm for Attribute Weighted Naive Bayes (WNB)

نویسندگان

  • Jia WU
  • Zhihua CAI
چکیده

The naive Bayes (NB) is a popular classification technique for data mining and machine learning, which is based on the attribute independence assumption. Researchers have proposed out many effective methods to improve the performance of NB by lowering its primary weakness---the assumption that attributes are independent given the class, such as backwards sequential elimination method, lazy elimination method and so on. Recently, Mark Hall presents a simple filter method for setting attribute weights for naive Bayes and proposes a decision tree-based attribute weighted method. In his paper, the experimental results show that the new weighted method performs better than other weighted methods. That weighting idea is taken as the objective of our study in which we use differential evolution algorithms to determine the weights of attributes and then use these weights in our previously developed Weighted Naïve Bayes (WNB). We evaluate the performance of new configuration (DE-WNB) on the whole 36 standard UCI data sets in Weka system. We also compare it with the decision tree-based attribute weighted methods and other methods mentioned in Mark Hall’ paper for those data sets. Our experimental results show that the classification accuracy of our new algorithm DE-WNB is much higher than those of the other algorithms used to compare. The obtained classification accuracy is very good with respect to other common WNB classifiers in literature.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Alleviating naive Bayes attribute independence assumption by attribute weighting

Despite the simplicity of the Naive Bayes classifier, it has continued to perform well against more sophisticated newcomers and has remained, therefore, of great interest to the machine learning community. Of numerous approaches to refining the naive Bayes classifier, attribute weighting has received less attention than it warrants. Most approaches, perhaps influenced by attribute weighting in ...

متن کامل

Attribute Weighting via Differential Evolution for Attribute Weighted Clonal Selection Algorithm

Classification is an important technology in data mining, while clonal selection algorithm (CSA) is a very effective classification method. Although CSA brings a new effective tool for solving complex problems, we can not completely say that it over-performs to other algorithms especially in the classification field. A main problem of CSA classifier is that it does not carry attribute imbalance...

متن کامل

Attribute Value Weighted Average of One-Dependence Estimators

Of numerous proposals to improve the accuracy of naive Bayes by weakening its attribute independence assumption, semi-naive Bayesian classifiers which utilize one-dependence estimators (ODEs) have been shown to be able to approximate the ground-truth attribute dependencies; meanwhile, the probability estimation in ODEs is effective, thus leading to excellent performance. In previous studies, OD...

متن کامل

Analysis of sequential physiology data with weighted naive Bayes

In this project, I describe how I address the ICML 2004 Physiological Data Modeling Contest. For the gender prediction task, I compressed the large entry-based dataset to small session-based dataset and manually devised 90 features using a histogram method. Weighted naive Bayes (WNB) which is an extension of naive Bayes was applied and Markov Chain Monte Carlo was combined to solve the weight u...

متن کامل

Naive Bayes Spam Filtering Using Word Position Attributes

This paper explores the use of the naive Bayes classifier as the basis for personalized spam filters. Various machine learning algorithms, including variants of naive Bayes, have previously been used for this purpose, but the author’s implementation using word position based attribute vectors gives very good results when tested on several publicly available corpora. The effect of various forms ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011